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Abstract. We call a CNF formula linear if any two clauses have at most one 
variable in common. Let Linear fc-SAT be the problem of deciding whether a given 
linear fc-CNF formula is satisfiable. Here, a fc-CNF formula is a CNF formula in 
which every clause has size exactly fc. It was known that for fc > 3, Linear fc-SAT 
is NP-complete if and only if an unsatisfiable linear fc-CNF formula exists, and 
that they do exist for k < 4. We prove that unsatisfiable linear fc-CNF formulas 
exist for every k. Let /(fc) be the minimum number of clauses in an unsatisfiable 
linear fc-CNF formula. We show that /(fc) G fi(fc2 fe )nO(4 fc fe 4 ), i.e., minimum size 
unsatisfiable linear fc-CNF formulas are significantly larger than minimum size 
unsatisfiable fc-CNF formulas. Finally, we prove that, surprisingly, linear fc-CNF 
formulas do not allow for a larger fraction of clauses to be satisfied than general 
fc-CNF formulas. 



1 Introduction 

A CNF formula F (conjunctive normal form) over a variable set V is a set of clauses; a 
clause is a set of literals; a literal is either a variable if For its negation x. A CNF 
formula F, or short, a CNF F, is called a fc-CNF if \C\ = fc for every C G F. Define 
vbl(a;) = vbl(z) := x for x G V, vbl(C) := {vbl(Z) \l € C} and vbl(F) := U Ce F vbl (C). 
For example, vbl({x, y,z}) — {x,y,z}. A (partial) assignment a is a (partial) function 
V — ► {0, 1}. It can be extended to negated variables by a(x) :— -^a(x). A clause is 
satisfied by a if at least one literal in it evaluates to 1, and a formula is satisfied if every 
clause is satisfied. Applying a partial assignment a means removing from F every clause 
satisfied by a, and from the remaining clauses removing all literals evaluating to 0. The 
resulting formula is denoted by F^ . 



Consider a set system S of sets of cardinality k over some ground set V, i.e. a k- 
uniform hypergraph. We say S is a fc-set system. We call S linear if \A D B\ < 1 for any 
A 5^ B. We do not use any deep results from hypergraph theory in this paper. 
Nevertheless, for definitions and basic terminology of hypergraphs we refer the reader 
to [T] or 0. 

A CNF F is linear if |vbl(C) D vbl(£>)| < 1 for all clauses C,D £ F, C ^ D. The set 
system {vbl(C), C G F} is called the skeleton of F. If F is a fc-CNF, then its skeleton is 
a fc-uniform hypergraph, which is linear if F is linear. Note that the converse does not 
hold in general: The formula {{x, y}, {x, y}} is not linear, but its skeleton is {{x,y}}, 
thus linear. 



Examples: The formula {{xi, X2}, {X2, £3}, {£3, X4}, {X4, x±}} is a linear 2-CNF, 
whereas {{xi, X2, £3}, {X2 £3, £4}} is not linear. 
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Previous Results 



Let fc-SAT be the problem of deciding whether a given fc-CNF is satisfiable. It is well- 
known that fc-SAT is NP-complete for fc > 3. Define Linear fc-SAT to be the corre- 
sponding decision problem for linear fc-CNFs. Porschen, Speckenmeyer and Randerath [3] 
observed that Linear fc-SAT is NP-complete if and only if there exists an unsatisfiable 
linear fc-CNF. They proved the existence of unsatisfiable linear fc-CNFs for fc = 2, 3. 
In [4], Porschen, Speckenmeyer and Zhao prove existence for k = 4. Up to now, for fc > 5 
the question whether unsatisfiable fc-CNFs exist has been open. 



Our Contribution 



We show that unsatisfiable linear fc-CNFs exist for any fc, hence establishing NP-completeness 
of Linear fc-SAT for all fc > 3. Further, let /(fc) denote the size of a smallest unsatisfiable 
linear fc-CNF. We prove that /(fc) 6 0(fc 4 4 fc ) and, using the Lovasz Local Lemma, show 
that /(fc) € J?(fc2 fc ). This is in contrast to the general (non-linear) case, where we know 
that unsatisfiable fc-CNFs with 2 k clauses exist. 



Having established /(fc) e 0(fc 4 4 fe ), we are still looking for explicit constructions of 
unsatisfiable linear fc-CNFs. We give a construction using < t(k) clauses, for t(0) := 1 
and t(k + 1) := t(k)2 t ^ k \ i.e., a tower-like function. Compared to the gigantic growth of 
t(k), even fc 4 4 fc seems very modest. 



2 Preliminaries 



Denote by L(n, fc) the maximum number of sets a linear fc-set system over n elements can 
have. In this section, we give some bounds on L(n, fc). Everything in this section is stan- 
dard graph and hypergraph theory. The following upper bound is an easy observation. 
See Theorem 3 in Chapter 1 of [2] for example. 



Lemma 2.1. 

n(n — 1) 



L(n,k) < 



k(k-l) 



Proof. Let 5 be a linear fc-system over n elements. There are Q) pairs of elements, 
and each fc-set in S contains (2) pairs. Since each pair is present in at most one set, we 
obtain \S\ < Q)/ (2)- " D 

If this upper bound is achieved, then every pair of elements occurs in exactly one set, 
and the set system S is also called a Steiner system. For existence of Stciner systems 
for specific values of n and fc see for example [5]. At this point, we only give a proof of 
existence of Steiner systems for fc being a prime power. 

Lemma 2.2. For every prime power k, there are infinitely many n such that 



Proof. Let fc be any prime power, and let be the finite field of cardinality fc. Let 
be the ci-dimensional vector space over Ffc. It has n = |F^| = k d elements, called 
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points. For x,y €¥f and y ^ 0, the set {x + Xy | A G F^} is called a line. A line contains 
exactly k points, and the vector space has 

(2) = - !) 
(*) - 1) 

lines: Every pair of distinct points a and b lies on exactly one line, namely {a+\(b— a) | A € 
Ffc}, and each line can contains ( 2 ) pairs of distinct points. Note that two lines intersect 
in at most one point. Let S be the fc-set system of all lines in Wf. Then S is a linear set 
system over n points, and 15*1 = -^j^rjy- D 

If k is not a prime power, we have the following weaker bound on L{n, k). 
Lemma 2.3. For any n, k G N, 

T , , . 2n(n - 1) 



Proof. Recall that any simple graph G on n vertices with maximum degree A has an 
independent set / C V with \I\ > -^j- This follows from a greedy construction: As long 
as G is not empty, pick a vertex and insert it into /. Remove it and all its < A neigh- 
bors. In every step, < A+l vertices are removed, hence we add at least vertices to /. 

For n, k G N, define a graph as follows: The vertices of the graph are all (^) /c-sets 
over n elements, and two sets are connected by an edge if they share more than one 
element. Each independent set of the graph corresponds to a linear fc-set systems over 
these n elements. We estimate the maximum degree of this graph. Let s be a fc-set. 
How many sets share two or more elements with s? There are possibilities to fix 2 
elements to be included in the neighbor set s', and (Z-2) possibilities to choose the rest. 
Of course, this will overcount the number of such sets. Hence there are at most ( 2 ) (uZ^) 
sets sharing two or more elements with s. Since s itself is counted among those, we have 
A + 1 < (2) (IZl) ■ The graph itself has (™) vertices, hence 



L(n,k) > 



(2) 



and the lemma follows from a simple calculation. □ 



3 Unsatisfiable fc-CNFs formulas and NP-hardness 

In this section, we will prove existence of unsatisfiable fc-CNFs for any fc, as well as 
proving some upper bounds on the number of clauses in such a formula. Porschen, 
Speckenmeyer and Randerath [3] already stated that for k > 3, Linear fc-SAT is NP- 
hard if there exists an unsatisfiable linear fc-CNF. To keep this paper self-contained, we 
include a proof of this result. 

Theorem 3.1 (Porschen, Speckenmeyer and Randerath |3j). For any k > 3, 

Linear k-SAT is NP-complete if there exists an unsatisfiable linear k-CNF. 
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Proof. We reduce fc-SAT to Linear fc-SAT. Since fc-SAT is NP-complete for k > 3, 
this will prove the theorem. Let F be a fc-CNF. We transform it to a linear fc-CNF 
F' such that F is satisfiable iff F' is. Let F have m clauses and n variables. For 
a variable x let d(x) denote the number of times x appears in F. Replace each x 
by d(x) new variables x\,... ,Xm x \. To ensure that F is satisfiable iff F' is, we force 
these variables to take on the same truth value by adding d(x) implication clauses 
{xi,x 2 },{x2,x s }, . . . ,{x d (x)-i,Xd(x)-i},{xd(x),%i}- Clearly the new formula F' is lin- 
ear, and it is satisfiable iff F is. However, F' is not a fc-CNF. We remedy this by adding 
fc — 2 new variables to each implication clause and forcing each of them to by adding 
a forcer. A y-forcer is a linear fc-CNF which is satisfiable iff y is set to 0. Such a for- 
mula can be obtained by taking any minimal unsatisfiable linear fc-CNF formula G y with 
y G vbl(G) and removing from G all clauses containing y. Adding a y-forcer to F' for 
each variable y we added to the implication clauses guarantees that F 1 is satisfiable iff 
F is. F' is a linear fc-CNF, and the proof is complete. □ 

3.1 Existence of Unsatisfiable Linear fc-CNFs 

We will complete the NP-completeness proof of Linear fc-SAT by showing that unsat- 
isfiable linear fc-CNFs exist, for any fc > 0. This answers the main open question from 
Porschen, Speckenmeyer and Randerath [3] and establishes the NP-completeness of Lin- 
ear fc-SAT for all fc > 3. 

Theorem 3.2. For any k G No, there are unsatisfiable linear k-CNFs. 

Proof. We prove this by induction on fc. For fc = 0, the formula F = {{}} con- 
taining only the empty clause is linear and unsatisfiable. For the induction step, let 
F = {Ci,...,C m } be an unsatisfiable linear fc-CNF. We will construct an unsatisfi- 
able linear (fc + 1)-CNF formula F'. Create m new variables Xi,... ,x m . For a clause 
D = {tii, ■ • • ! u m } with Ui G {xi, Xi}, define 

F ® D :— {d U {Ui} \ i = l,...,m} . 

F' is a linear (fc + 1)-CNF formula, and every assignment satisfying F ® D satisfies D. 
Create 2 m variable disjoint copies Fx, . . . , F 2 ™ of F, i.e., vbl(Fj) n vbl(i^ ) = for i ^ j. 
By choosing 2 m different sign patterns, we create 2 m distinct m-clauses D\, . . , , £>2>™ over 
the variables Xj. The formula {D\, . . . , £>2 m } is unsatisfiable. Hence 

2 m 

F' := |J Fi (8) A 

i=l 

is unsatisfiable, as well. Clearly, F 1 is a linear (fc + 1)-CNF. □ 

This proof constitutes an explicit construction, but note the gigantic growth of the 
size of the constructed formulas: Let t(k) denote the number of clauses of the fc-CNF 
formula generated in this construction. Then t(k + 1) = t(k)2 t ^- k \ so we have t(l) = 2, 
i(2) = 8, i(3) = 2048, t(4) = 20 48 x 2 2048 . Fortunately there is a much better upper 
bound, obtained by a probabilistic argument. 

Theorem 3.3. For every k G No, there exist an unsatisfiable k-CNF F with 

\F\ < fc 4 4 fe . 
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Proof. Fix any fc € No- Let V be a set of n variables, n to be specified later. Let S 
be a linear fc-set system over V and write to :— \S\. From each s£S, build a fc-clause by 
choosing uniformly at random one of the 2 k possible sign patterns. Do this independently 
for each s S S and obtain a linear fc-CNF F. Fix an assignment a. For every set s G S, 
the probability that the clause C built from s is satisfied by a is 1 — 2~ fc . Since the sign 
pattern of each clause is chosen independently, we obtain 

Pr (a satisfies F) = (l - 2~ fc ) m 

There are 2™ different truth assignments to V, thus the probability that at least one 
of them satisfies F can be estimated by the union bound: 

Pr (F is satisfiable ) < 2" (l - 2~ fc ) m 

If 2™ (l — 2~ fe )'" < 1, then there exists an unsatisfiable linear fc-CNF with to clauses 
and n variables. Since 1 + x < e x for all x ^ 0, we have 2™ (l - 2~ k ) m < e ™ ln2 - m2 k ; 
and 

e „ln2-m2-* < j ^ 
TO 

n In 2 r < 

2 fc - 

2 fe nln2<TO. (1) 

That is, if to > 2 fe nln2, then the random formula f 1 is unsatisfiable with positive 
probability. By Lemma l2.3l we know that there is a linear fc-set system S over n elements 
of size 

r ^ i ro^2~\ 

(2) 

Since to grows superlinearly in n, we see that for sufficiently large n the last inequality 
holds, which implies that there is an unsatisfiable linear fc-CNF of size to over n variables. 
To obtain an upper bound on n and to, plug ([2]) into {T]): 

2 fe nln2< =!L 
" fc 4 

ln2fc 4 2 fc 

Since we are interested in the order of growth for large fc rather than in constant factors, 
write 

m e ° (f) = ° {kHk) ■ 

Therefore, there is an unsatisfiable linear fc-CNF F over n € (fc 4 2 fc ) variables having 
to 6 (fc 4 4 fc ) clauses, and the theorem follows. □ 

This is the best upper bound we have. It is much better than the explicit construction 
of Theorem 13.21 but it is still far away from the best lower bound of Q(k2 k ). 

4 Partial Satisfaction in Linear 2-CNF Formulas 

It is well known that every unsatisfiable fc-CNF contains at least 2 k clauses. This bound 
is tight, since the fc-CNF Fk consisting of all 2 k clauses over some variable set V, \V\ = fc, 



2n(n 


-1)" 


> 


~2n 2 ~ 


k 2 (k - 


-1)2 




fc 4 
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is unsatisfiable. Further, for every fc-CNF, there exists an assignment satisfying at least 
(l-2" fe )|F| clauses. This can be seen by choosing a random assignment and calculating 
the expected number of satisfied clauses. This bound is also tight, as Fk demonstrates. 
This is interesting: The upper bound on the fraction of clauses one can always satisfy is 
achieved by a smallest unsatisfiable formula. Since unsatisfiable linear fc-CNFs are much 
larger than 2 k , as we will see, one might suspect that linear fc-CNFs are more amenable 
to partial satisfaction than general fc-CNFs, i.e., that for at least some fc, there is an 
rk > (1 — 2~ fe ) such that every linear fc-CNF F admits an assignment satisfying > rk\F\ 
of its clauses. However, this is not true: 

Theorem 4.1. For every fc G N and S > 0, there is a linear k-CNF Fk y s such that every 
assignment leaves at least fraction of (1 — 5)2~ k of all clauses unsatisfied. 



Proof. The proof is similar to the probabilistic proof of Theorem 13.31 Given fc, fix a 
linear set system S over ground set V. Let n :— \V\, m := \F\, which will be determined 
later. Fix an assignment a on V and build a random formula F over the skeleton S by 
randomly choosing the signs of the literals in every clause. Let F = {C\, . . . , C m } and 
define m random variables Xi by 



X,, = 



if a satisfies C\ 

1 otherwise. 



Define X := J2iLi x i- Observe that n := E[X,] = 2~ k and E[X] = 2- k m. We want 
to bound the probability that less that (1 — 8)2~ k m clauses are unsatisfied by a. First 
observe that the Xi are independently identically distributed binary random variables 
with expectation 2 . Therefore, X has a binomial distribution with expectation /i = 
2~ k m, and Chernoff 's inequality yields 

Pr{X < (l-S)fj,} < e-^ 
For a derivation of this inequality see e.g. [6]. Applying the union bound, we estimate 



Pr{3a leaving < (1 — <5)/i clauses unsatisfied } < 2 



and want last term to be smaller than 1. By Lemma [2.31 we can choose m > and 
calculate 

n In 2 - — < 
2 

^ = 2- fe -W>nln2 
2 

m > 2 fe+1 nln2 ( r 2 (3) 

^>2 fe+1 7iln2 ( 5- 2 
fc 4 

n > fc 4 2 fe ln2<T 2 (4) 

For every fixed fc and S > 0, we can make the last inequality true by choosing n suffi- 
ciently large. Therefore, there is a positive probability that the randomly chosen formula 
F does not have a truth assignment satisfying more than (1 — (1 — 8)2~ k )m clauses. □ 

By setting e = 62~ k , we see that there is a linear fc-CNF F in which no more than 
(l — 2~ fc + e)|F| clauses can be satisfied. Note that the proof of Theorem 14.11 is not 
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specific to linear CNFs. For a more general setting, call a property of formulas structural 
if it only depends on the skeleton of the formula, not on its signs. For a structural propery 
V, let ex-p(n, k) be the maximum number of clauses a fc-CNF over n variables having 
property V can have. 

Theorem 4.2. Let V be a structural property of CNFs. If for fixed k, ex-p(n, k) grows 
superlinearly in n, then for every e > 0, there exists a formula F e for which no truth 
assignment a satisfies more than (1 — 2~ k + e)|.F e | clauses. 



5 Lower Bounds 

After having established that f(k) G 0{k 4 2 k ), we want to obtain lower bounds on f(k). 
To be more precise, we show that f(k) G Q(k2 k ). We prove this by repeated application 
of the Lovasz Local Lemma. For a formula F, define the neighborhood of a clause C to 
be 

T(C) := {D G F | vbl(D) n vbl(C) ^ 0, C # £>} . 

It follows from the Local Lemma that a /c-CNF with \r(C)\ < \2 k for every clause C 
is satisfiable (the constant \ can be improved upon). Conversely, if F is unsatisfiable, it 
contains a clause C with a large neighborhood. We find a partial assignment a on vbl(C) 
that satisfies C and a large part of its neighborhood, say at least c2 k clauses, for some 
constant c. Since F is linear, applying a deletes at most one literal from any clause in 
r(C), hence F^ is a (fc — 1)-CNF. Here, we can again apply the Local Lemma and satisfy 
c2 fe ~ 1 clauses, and so on. Repeating k times, we have satisfied at least c(2 k + 2 k ~ x + 
• • ■ + 2 1 ) = c2 k+1 — 2c clauses. Unfortunately, this is not enough. We must somehow take 
advantage of the fact that though F^ contains (k — l)-clauses, the neighborhood of a 
clause in F^ cannot contain too many of them. 

Lemma 5.1 (Lovasz Local Lemma). Let Ai, . . . , A m be events in some probability 
space, and let G be a graph with vertices A\ , . . . , A m and edges E such that each Ai 
is mutually independent of all the events {Aj | {Ai,Aj} $ E, i ^ j}. If there exist real 
numbers < "fi < 1 for i — 1, . . . , m satisfying 

Pr(Ai) < 7i J] (1-7,0 

j:{Ai ,Aj}eE 

for all i — 1, . . . , m, then 

Pr(Ai U A 2 U ■■■UA m ) < 1 



For a proof of the Lovasz Local Lemma and different versions, see e.g. [?]• 

Lemma 5.2. Let F be a CNF not containing any clause of size < 1. If for any C G F 
it holds that 

y 2-\°\<- 

- 4 

Der(c) 

then F is satisfiable. 

Proof. This is an application of the Lovasz Local Lemma. Let the probability space 
be the set of all truth assignments to the n variables in F with the uniform distribution. 
Write F — {C\, . . . , C m } and let Ai be the event that a random assignments a does not 
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satisfy clause Ci. Let G be the graph where Ai and Aj are conntected if they have a 
variable in common, and let ji := 2 1_ ' c '' < 1. For each i = 1, . . . , m, we have 

7* n (1-7,) 

> Till" J2 Til 

V c 3 er(c,) / 

> 2-l c 'l=Pr(A,) 

Hence, by Lemma \5A\ the probability that a leaves some clause unsatisfied is < 1, and 
thus with positive probability, a satisfies F. Therefore, F is satisfiable. □ 



Definition 5.3. Let k be fixed. An [l,k]-CNF is a CNF with I < \C\ < k for every 
C G F. For an [I, k]-CNF F and a variable x, let 

d F (x) := \ {C e F | x £ vbl(C), \C\ < k - l}| 

J/ £/iere is no danger of confusion, we will simply write d(x). Further, let 

d(F) := max dp(x) 

a;evbl(F) 

Finally, define 

r'{C) := {D G r{C) I |D| < k - 1} 
r^(C) :={D6r'(C) | z 6 vbl(£>)}. 

Lemma 5.4. Let F &e an [I, k]-CNF. Then for any clause C G F 

\r\c)\<kd{F) 



Proof. We simply calculate 



\r\c)\ = 



U r ^°) 

xGvbl(C) 



< E \r' x (C)\<\c\d F {x)<kd{F) . 

xevh\(C) 



and the lemma follows. 



□ 



We need a lemma that states that after setting the variables of C such that C and a 
large part of its neighborhood is satisfied, d{F) does not increase too much. 

Lemma 5.5. Let F be a linear [l,k]-CNF and C be any clause in F. Let a be any 
assignment that is defined only on the variables of C . If a satisfies C, then d(F^) < 
d(F) + k, and F^ is a linear [I - 1, k]-CNF. 
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Proof. Since F is linear, C is the only clause containing more than one variable in 
the domain of a. Since a satisfies C, every clauses loses at most one literal, and hence 
F^ is an [/ — 1, fc]-CNF. Surely, it is linear as well. To bound the amount by which d(F) 
increases, consider any y G vbl(-F). If y is set by a, then d F [ a ] (y) = 0. Otherwise, dp[a\ (y) 
is at most plus the number of clauses that count additionally towards d(y), i.e., 

clauses D with y G vbl(D), \D\ = k and \D^\ =k-l. Clearly, D 6 r(C), otherwise its 
size would not decrease under a. For each x G vbl(C), there is at most one such clause 
D G r(C), since x,y € vbl(-D) and F is linear. Hence there are at most \C\ < k such 
clauses, thus d F[a \ (y) < d F (y) + k. Therefore, d(F^) < d(F) + k holds as well. □ 

Corollary 5.6. Let F be an unsatisfiable [I, k]- CNF for I > 2. There is a partial as- 
signment a such that F^ is an [I — 1, k]-CNF, d(F^) < d(F) + k, and a satisfies at 
least 

(2 k - 2 - kd(F)2 k - 1 ) 

clauses of F. 

Proof. By Lemma [5.2[ we know that if F is unsatisfiable, there is a clause C G F 
with 

l-W > - 



E 2- 



Der(C) 

Further, using Lemma l5.4| we can estimate 



-l 



2 HD| < \r{C)\2- k + kd(F)2 
Der(c) 

And thus, solving for |P(C)|, 

\r(C)\ > 2 k ~ 2 ~kd{F)2 k - 1 (5) 

Let xi, . . . , x\c\ be the variables of C. Since the r Xi (C) are pairwise disjoint, by the 
pigeonhole principle there is an Xi such that |-r ai (C)| < \r(C)\/l. Set a(xt) such that a 
satisfies C. For the remaining C — 1 variables Xj of C , set ct(xj) such that it satisfies 
at least \r xj \/2. Overall, we satisfy at least 

n ~V(c)| 



2\C\ 

clauses. Inequality © and the fact that C > I imply the lemma. □ 



Theorem 5.7. Let f(k) := min {\F\ | F is an unsatisfiable linear fc-CNF}. Thenf(k) G 
fl (k2 k ) . 

Proof. Let F be an unsatisfiable linear fc-CNF. We show that |F| G fl (k2 k ). Define 
Fi for < i < k — 1 as follows: F := F. For < i < k — 2, apply Corollary 15.61 on F t 
and let q;, be a partial assignment as described in the corollary. Define F i+ i = F^'. It 
follows that Fi is an unsatisfiable [k — i, fc]-CNF, d{Fi) < ik, and aj satisfies at least 



X 



clauses of F i: i.e., \Fi\ — \F i+ i\ > M(i). Hence, for any Q < j < k — 1, we obtain 




Plugging in for example j — k — 31ogfc yields the claimed bound of \F\ G SI (k2 k J. □ 
6 Small Unsatisfiable Linear 3-CNFs 

In this section, we construct small unsatisfiable linear 3-CNFs. However, we do not know 
the exact value of /(3). Consider the formula 

{{xi,X 2 }, {S 2 ,X 3 }, . . . , X„}, {x n , Xi}} . 

Every assignment satisfying it sets all Xi to 1 or all to 0. We use this formula as a 
gadget for building so-called forcers. A formula F is called a C -forcer if every assignment 
satisfying F satisfies C. Define 

^6 := {{xi,x 2 }, {x 2 ,x s }, {x3,x 4 }, {x 4 ,x 4 }, {xi,x 3 }, {x 2 ,x 4 } 

and note that it is unsatisfiable. For a clause {u,v,w}, define 

F 6 ({u,v,w}) := {{xi,x 2 ,u}, {x 2 ,x 3 ,v},{xs,x 4 ,u}, {x 4 ,x 1 ,v},{x 1 ,x 3 ,w},{x 2 ,x 4 ,w}} . 

This formula is a linear 3-CNF and a {u, v, u>}-forcer. A nice property of this forcer is 
that no variables in the forced clause occur together in one of the clauses of a forcer. 
Taking the union of the forcers Fq(C) for all 8 clauses C over {it, v, w} (and renaming 
the variables Xi each time, to ensure linearity), we obtain an unsatisfiable linear 3-CNF 
with 48 clauses. This is exactly the construction used in the proof of Theorem 13.21 

We can improve the above construction. Define 

F 8 ({u, v}) := {{xi, x 2 , u}, {x 2 , x 3 ,v}, {x 3 , x 4 ,u} 7 {x 4l x 5 , v}, {z 5 , x 6 , it}, {x 6 , x 4l v}, 
{xi,x 3 ,x 5 },{x 2 ,x 4 ,x 6 }} . 

This is ax {it, w}-forcer with 8 clauses. Building 4 forcers for the clauses {11, i>}, {it, v}, {u, v}, {u, v}, 
we obtain an unsatisfiable linear 3-CNF with 32 clauses. 



We go on: Consider {{u,v},{u,v}, {u,v},{u,v,w},{u,v,w}}. Clearly, this formula 
is unsatisfiable. Build the forcer F 8 (C) for the three 2-clauses. Then build F e ({u,v, w}). 
Finally, add {u, v, w}} and obtain an unsatisfiable linear 3-CNF with 3x8 + 6 + 1 = 31 
clauses. 



The trick here was that we can afford to enforce one clause C not by using F 6 (C), 
but by directly including it into our final formula. Of course, we must keep that final 
formula linear, hence we cannot apply this trick too often. However, we can tweak the 
formula such that we can apply this trick twice. Consider 



Fvi = {{u, v, w}, {u, v 7 w}, {v, w}} = {Cx, C 2 , C 3 } 
Fm = {{w, x, y}, {w, x, y}, {w, x}} = {D 1 ,D 2 ,D 3 } 



XI 



The formula F w is a {«;}-forcer, and is a {w}-forcer. We can build a linear formula 
from which F w and F^ can be derived: 

F := F 6 (C 2 ) U F 6 (D 2 ) U F 8 (C 3 ) U F 8 CD 3 ) U {Ci} U {£>i} 

Here, we applied the above trick of directly including a desired clause into the final 
formula twice, namely to C\ and D\ . This is an unsatisfiable linear 3-CNF with 6 + 6 + 
8 + 8 + 1 + 1 = 30 clauses. 

7 Conclusion 

We showed that the size of a smallest unsatisfiable linear fc-CNF is in fl (fc2 fc ) nO (fc 4 4 fc ) . 
However, the best constructive upper bound is a tower-like function. It is desirable to 
find a way to construct unsatisfiable fc-CNFs of reasonable size, since this will give much 
better insight into the structure of those formulas than a probabilistic proof. 

One approach would be to use hypergraph vertex coloring problems: If one translates 
the fc-colorability problem of a <i-uniform linear hypergraph into a CNF in the natural 
way, one obtains a linear CNF with clauses of size k and d. There are linear d-uniform 
hypergraphs with arbitrarily large chromatic number, for any d. This follows e.g. from 
the Hales- Jewett-Theorem [8] on combinatorial lines. However, the bounds obtained from 
this theorem are tower-like, too. 
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